Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring

نویسندگان

چکیده

Recent work on unsupervised speech segmentation has used self-supervised models with phone and word modules that are trained jointly. This paper instead revisits an older approach to segmentation: bottom-up phone-like unit discovery is performed first, symbolic then top of the discovered units (without influencing lower level). To do this, I propose a new model, chain two segment speech. Both use dynamic programming minimize costs from network additional duration penalty encourages longer units. Concretely, for acoustic discovery, duration-penalized (DPDP) contrastive predictive coding model as scoring network. For segmentation, DPDP applied autoencoding recurrent neural The chained in order gives comparable results state-of-the-art joint English benchmark. On French, Mandarin, German Wolof data, it outperforms previous systems ZeroSpeech benchmarks. Analysis shows system segments shorter filler words well, but might require some external top-down signal.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word and phone level acoustic confidence scoring

This paper presents a word level confidence scoring technique based on a combination of multiple features extracted from the output of a phonetic classifier. The goal of this research was to develop a robust confidence measure based strictly on acoustic information. This research focused on methods for augmenting standard log likelihood ratio techniques with additional information to improve th...

متن کامل

Waterloo at NTCIR-3: Using Self-supervised Word Segmentation

In this paper, we describe the system we use in the NTCIR-3 CLIR (cross language IR) task. We participate the SLIR (single language IR) track. In our system, we use a self-supervised word-segmentation technique for Chinese information retrieval, which combines the advantages of traditional dictionary based approaches with character based approaches, while overcoming many of their shortcomings. ...

متن کامل

Nonparametric Bayesian Semi-supervised Word Segmentation

This paper presents a novel hybrid generative/discriminative model of word segmentation based on nonparametric Bayesian methods. Unlike ordinary discriminative word segmentation which relies only on labeled data, our semi-supervised model also leverages a huge amounts of unlabeled text to automatically learn new “words”, and further constrains them by using a labeled data to segment non-standar...

متن کامل

Semi-supervised Chinese Word Segmentation for CLP2012

Chinese word segmentation (CWS) lays the essential foundation for Mandarin Chinese analysis. However, its performance is always limited by the identification of unknown words, especially for short text such as Microblog. While local context are helpless in handling unknown words, global context do manifest enough contextual information, and could be used to guide CWS process. Based on this moti...

متن کامل

Semi-supervised Chinese Word Segmentation based on Bilingual Information

This paper presents a bilingual semisupervised Chinese word segmentation (CWS) method that leverages the natural segmenting information of English sentences. The proposed method involves learning three levels of features, namely, character-level, phrase-level and sentence-level, provided by multiple submodels. We use a sub-model of conditional random fields (CRF) to learn monolingual grammars, ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing

سال: 2023

ISSN: ['2329-9304', '2329-9290']

DOI: https://doi.org/10.1109/taslp.2022.3229264